# Low-latency speech synthesis
Llama3.1 Typhoon2 Audio 8b Instruct
Typhoon 2-Audio Edition is an end-to-end speech-to-speech model architecture capable of processing audio, speech, and text inputs while simultaneously generating both text and speech outputs. The model is specifically optimized for Thai language while also supporting English.
Text-to-Audio
Transformers Supports Multiple Languages

L
scb10x
664
9
Mms Spa Finetuned Colombian Monospeaker
This is a Spanish TTS model based on MMS, fine-tuned using the VITS architecture, requiring only 80-150 samples and 20 minutes of training time to generate Spanish speech with a Colombian accent.
Speech Synthesis
Transformers Spanish

M
ylacombe
71
1
Mms Spa Finetuned Argentinian Monospeaker
This is a fine-tuned model based on the MMS Spanish version, built using the VITS architecture, trained with only 80 to 150 samples in approximately 20 minutes.
Speech Synthesis
Transformers Spanish

M
ylacombe
88
3
Featured Recommended AI Models